Search CORE

121 research outputs found

Synthesis of clay-CNTs nanocomposite

Author: Al-Ajmi Kawthar
Lophatananon Artitaya
Muir Kenneth R
Ollier William
Yuille Martin
Publication venue: Springer Nature
Publication date: 01/01/2013
Field of study

Crossref

Springer - Publisher Connector

The University of Manchester - Institutional Repository

Explore Bristol Research

ForestHash: Semantic Hashing With Shallow Random Forests and Tiny Convolutional Networks

Author: A Criminisi
A Krause
AL Yuille
BK Sriperumbudur
G Huang
G Nemhauser
GA Watson
J Masci
J Shotton
JR Quinlan
K He
L Breiman
M Aharon
ME Hellman
Q Qiu
Y Bengio
Y Lecun
Publication venue
Publication date: 27/07/2018
Field of study

Hash codes are efficient data representations for coping with the ever growing amounts of data. In this paper, we introduce a random forest semantic hashing scheme that embeds tiny convolutional neural networks (CNN) into shallow random forests, with near-optimal information-theoretic code aggregation among trees. We start with a simple hashing scheme, where random trees in a forest act as hashing functions by setting `1' for the visited tree leaf, and `0' for the rest. We show that traditional random forests fail to generate hashes that preserve the underlying similarity between the trees, rendering the random forests approach to hashing challenging. To address this, we propose to first randomly group arriving classes at each tree split node into two groups, obtaining a significantly simplified two-class classification problem, which can be handled using a light-weight CNN weak learner. Such random class grouping scheme enables code uniqueness by enforcing each class to share its code with different classes in different trees. A non-conventional low-rank loss is further adopted for the CNN weak learners to encourage code consistency by minimizing intra-class variations and maximizing inter-class distance for the two random class groups. Finally, we introduce an information-theoretic approach for aggregating codes of individual trees into a single hash code, producing a near-optimal unique hash for each class. The proposed approach significantly outperforms state-of-the-art hashing methods for image retrieval tasks on large-scale public datasets, while performing at the level of other state-of-the-art image classification techniques while utilizing a more compact and efficient scalable representation. This work proposes a principled and robust procedure to train and deploy in parallel an ensemble of light-weight CNNs, instead of simply going deeper.Comment: Accepted to ECCV 201

arXiv.org e-Print Archive

Crossref

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

Author: Chen L-C
Kokkinos I
Murphy K
Papandreou G
Yuille AL
Publication venue: IEEE COMPUTER SOC
Publication date: 26/04/2017
Field of study

In this work we address the task of semantic image segmentation with Deep Learning and make three main contributions that are experimentally shown to have substantial practical merit. First, we highlight convolution with upsampled filters, or `atrous convolution', as a powerful tool in dense prediction tasks. Atrous convolution allows us to explicitly control the resolution at which feature responses are computed within Deep Convolutional Neural Networks. It also allows us to effectively enlarge the field of view of filters to incorporate larger context without increasing the number of parameters or the amount of computation. Second, we propose atrous spatial pyramid pooling (ASPP) to robustly segment objects at multiple scales. ASPP probes an incoming convolutional feature layer with filters at multiple sampling rates and effective fields-of-views, thus capturing objects as well as image context at multiple scales. Third, we improve the localization of object boundaries by combining methods from DCNNs and probabilistic graphical models. The commonly deployed combination of max-pooling and downsampling in DCNNs achieves invariance but has a toll on localization accuracy. We overcome this by combining the responses at the final DCNN layer with a fully connected Conditional Random Field (CRF), which is shown both qualitatively and quantitatively to improve localization performance. Our proposed “DeepLab” system sets the new state-of-art at the PASCAL VOC-2012 semantic image segmentation task, reaching 79.7 percent mIOU in the test set, and advances the results on three other datasets: PASCAL-Context, PASCAL-Person-Part, and Cityscapes. All of our code is made publicly available online

UCL Discovery

Cycle-based Cluster Variational Method for Direct and Inverse Inference

Author: A Decelle
A Lage-Castellanos
A Montanari
A Pelizzola
A Ramezanpour
AL Yuille
Aurélien Decelle
C Furtlehner
Cyril Furtlehner
E Sudderth
G Cooper
G Parisi
H Höfling
H Kappen
H Nguyen
HA Bethe
J Horton
J Mooij
J Pearl
J Xiao
JS Yedidia
K Tanaka
M Gabrié
M Mézard
M Wainwright
M Wainwright
M Welling
M Yasuda
P Pakzad
P Ravikumar
R Kikuchi
R Rizzi
R Savit
S Cocco
S Lauritzen
S Shimony
T Heskes
T Jörg
T Kavitha
T Morita
V Kolmogorov
Y LeCun
Y Weiss
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 09/02/2016
Field of study

We elaborate on the idea that loop corrections to belief propagation could be dealt with in a systematic way on pairwise Markov random fields, by using the elements of a cycle basis to define region in a generalized belief propagation setting. The region graph is specified in such a way as to avoid dual loops as much as possible, by discarding redundant Lagrange multipliers, in order to facilitate the convergence, while avoiding instabilities associated to minimal factor graph construction. We end up with a two-level algorithm, where a belief propagation algorithm is run alternatively at the level of each cycle and at the inter-region level. The inverse problem of finding the couplings of a Markov random field from empirical covariances can be addressed region wise. It turns out that this can be done efficiently in particular in the Ising context, where fixed point equations can be derived along with a one-parameter log likelihood function to minimize. Numerical experiments confirm the effectiveness of these considerations both for the direct and inverse MRF inference.Comment: 47 pages, 16 figure

arXiv.org e-Print Archive

HAL-CentraleSupelec

Crossref

INRIA a CCSD electronic archive server

HAL-Rennes 1

Truncated Inference for Latent Variable Optimization Problems: Application to Robust Estimation and Learning

Author: AL Yuille
AP Dempster
B Scellier
C Zach
CJ Wu
CL Byrne
D Geman
DP Bertsekas
DR Hunter
F Scarselli
FJ Pineda
J Mairal
K Lange
M Razaviyayn
P Dvurechensky
PF Felzenszwalb
RM Neal
S Agarwal
SJ Wright
X Xie
Y LeCun
ZQ Luo
Publication venue
Publication date: 01/01/2020
Field of study

Optimization problems with an auxiliary latent variable structure in addition to the main model parameters occur frequently in computer vision and machine learning. The additional latent variables make the underlying optimization task expensive, either in terms of memory (by maintaining the latent variables), or in terms of runtime (repeated exact inference of latent variables). We aim to remove the need to maintain the latent variables and propose two formally justified methods, that dynamically adapt the required accuracy of latent variable inference. These methods have applications in large scale robust estimation and in learning energy-based models from labeled data.Comment: 16 page

arXiv.org e-Print Archive

Crossref

Chalmers Research

Supergravity Solutions from Floating Branes

Author: A Sen
AA Pomeransky
AL Yuille
B Whitt
Clément Ruef
CN Gowdigere
D Gaiotto
D Rasheed
DJ Gross
EG Gimon
F Larsen
G Lopes Cardoso
H Elvang
H Elvang
H Elvang
I Bena
I Bena
I Bena
I Bena
I Bena
I Bena
I Bena
I Bena
I Bena
I Bena
Iosif Bena
J Camps
J Evslin
J Evslin
J Evslin
J Ford
J Ford
JB Gutowski
JB Gutowski
JM Maldacena
JP Gauntlett
JP Gauntlett
K Goldstein
K Hotta
K Skenderis
M Cariglia
M Dunajski
N Bobev
Nicholas P. Warner
O Lunin
O Lunin
P Berglund
P Kaste
R Emparan
R Emparan
R Emparan
R Emparan
S Chiossi
S Giusto
S Giusto
S Giusto
S Giusto
SD Mathur
Stefano Giusto
T Matos
V Balasubramanian
V Balasubramanian
V Belinski
V Cardoso
V Jejjala
W Israel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 12/10/2009
Field of study

We solve the equations of motion of five-dimensional ungauged supergravity coupled to three U(1) gauge fields using a floating-brane Ansatz in which the electric potentials are directly related to the gravitational warp factors. We find a new class of non-BPS solutions, that can be obtained linearly starting from an Euclidean four-dimensional Einstein-Maxwell base. This class - the largest known so far - reduces to the BPS and almost-BPS solutions in certain limits. We solve the equations explicitly when the base space is given by the Israel-Wilson metric, and obtain solutions describing non-BPS D6 and anti-D6 branes kept in equilibrium by flux. We also examine the action of spectral flow on solutions with an Israel-Wilson base and show that it relates these solutions to almost-BPS solutions with a Gibbons-Hawking base.Comment: 24 pages, 1 figur

arXiv.org e-Print Archive

Crossref

Springer - Publisher Connector

Archivio istituzionale della ricerca - Università di Genova

Archivio istituzionale della ricerca - Università di Padova

Feedforward Inhibition and Synaptic Scaling – Two Sides of the Same Coin?

Author: A Bacci
AJ Watt
AL Yuille
AL Yuille
AP Dempster
C Assisi
Christian Keck
Cristina Savin
D Marr
DJ Heeger
DL Ringach
F Pouille
F Pouille
FS Chance
G Hinton
G Sclar
GG Turrigiano
GG Turrigiano
GG Turrigiano
HA Swadlow
I Klinkenberg
IM Elfadel
IM Finn
J Bruna
J Echegoyen
J Fellous
J Fiser
J Lücke
JH Reynolds
JS Isaacson
Jörg Lücke
KR Leslie
LF Abbott
M Carandini
M Ranzato
M Sahani
M Stopfer
M Wehr
N Ueda
NC Rabinowitz
NC Rust
O Schwartz
O Schwartz
Olaf Sporns
P Berkes
P Dayan
R Neal
RJ Douglas
RO Duda
S Deneve
SA Baccus
SB Hofer
SC Liu
SP Kuo
SR Olsen
SR Olsen
T Fukai
T Kwok
T Lochmann
T Ohshiro
V Mante
W Gerstner
W Mittmann
Y LeCun
Y Shu
ZH Mao
Publication venue: Public Library of Science
Publication date: 22/03/2012
Field of study

Feedforward inhibition and synaptic scaling are important adaptive processes that control the total input a neuron can receive from its afferents. While often studied in isolation, the two have been reported to co-occur in various brain regions. The functional implications of their interactions remain unclear, however. Based on a probabilistic modeling approach, we show here that fast feedforward inhibition and synaptic scaling interact synergistically during unsupervised learning. In technical terms, we model the input to a neural circuit using a normalized mixture model with Poisson noise. We demonstrate analytically and numerically that, in the presence of lateral inhibition introducing competition between different neurons, Hebbian plasticity and synaptic scaling approximate the optimal maximum likelihood solutions for this model. Our results suggest that, beyond its conventional use as a mechanism to remove undesired pattern variations, input normalization can make typical neural interaction and learning rules optimal on the stimulus subspace defined through feedforward inhibition. Furthermore, learning within this subspace is more efficient in practice, as it helps avoid locally optimal solutions. Our results suggest a close connection between feedforward inhibition and synaptic scaling which may have important functional implications for general cortical processing

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Hochschulschriftenserver - Universität Frankfurt am Main

FigShare

Haptic Edge Detection Through Shear

Author: AL Yuille
AP Sripati
AP Witkin
C Lucarotti
CE Connor
D Marr
DG Lowe
E Brown
GA Orban
GJ Gerling
H Jörntell
HR Nicholls
II Argatov
IV Lindell
J Amend
JA Pruszynski
JJ Koenderink
JJ Koenderink
JR Phillips
JR Phillips
K Sato
LR Harmon
M Briot
M Nakatani
M Nakatani
MA Plaisier
MH Lee
ML Hammock
NJ Ferrier
Q Wang
R Estrada
R Howe
R Kikkuwe
RS Dahiya
RS Fearing
S Hsiao
SJ Bensmaia
TH Speeter
V Hayward
V Hayward
V Hayward
V Levesque
V Torre
WD Hillis
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/03/2016
Field of study

Most tactile sensors are based on the assumption that touch depends on measuring pressure. However, the pressure distribution at the surface of a tactile sensor cannot be acquired directly and must be inferred from the deformation field induced by the touched object in the sensor medium. Currently, there is no consensus as to which components of strain are most informative for tactile sensing. Here, we propose that shape-related tactile information is more suitably recovered from shear strain than normal strain. Based on a contact mechanics analysis, we demonstrate that the elastic behavior of a haptic probe provides a robust edge detection mechanism when shear strain is sensed. We used a jamming-based robot gripper as a tactile sensor to empirically validate that shear strain processing gives accurate edge information that is invariant to changes in pressure, as predicted by the contact mechanics study. This result has implications for the design of effective tactile sensors as well as for the understanding of the early somatosensory processing in mammals

Crossref

SAS-SPACE

PubMed Central

Population based allele frequencies of disease associated polymorphisms in the Personalized Medicine Research Project

Author: A Galvan
A Yesupriya
AL Price
B Vincenzi
BK Rana
BM Kuehn
BP McEvoy
CA McCarty
Catherine A McCarty
D Serre
Deanna S Cross
DS Cross
Elisha L Stefanski
GA Thorisson
HM Cann
HS Bachmann
HW Chung
HY Huang
J Feder
J Feder
JI Koontz
JK Sharp
JM Pulley
K Wang
KL Golden
LL Cavalli-Sforza
Lynn C Ivacic
M Loh
M Yuille
MC Camargo
MH Chang
MM Sale
NA Rosenberg
PF Giampietro
R Pahl
S Myles
ST Sherry
T Dadd
U Hannelius
W Guan
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background There is a lack of knowledge regarding the frequency of disease associated polymorphisms in populations and population attributable risk for many populations remains unknown. Factors that could affect the association of the allele with disease, either positively or negatively, such as race, ethnicity, and gender, may not be possible to determine without population based allele frequencies. Here we used a panel of 51 polymorphisms previously associated with at least one disease and determined the allele frequencies within the entire Personalized Medicine Research Project population based cohort. We compared these allele frequencies to those in dbSNP and other data sources stratified by race. Differences in allele frequencies between self reported race, region of origin, and sex were determined. Results There were 19544 individuals who self reported a single racial category, 19027 or (97.4%) self reported white Caucasian, and 11205 (57.3%) individuals were female. Of the 11,208 (57%) individuals with an identifiable region of origin 8337 or (74.4%) were German. 41 polymorphisms were significantly different between self reported race at the 0.05 level. Stratification of our Caucasian population by self reported region of origin revealed 19 polymorphisms that were significantly different (p = 0.05) between individuals of different origins. Further stratification of the population by gender revealed few significant differences in allele frequencies between the genders. Conclusions This represents one of the largest population based allele frequency studies to date. Stratification by self reported race and region of origin revealed wide differences in allele frequencies not only by race but also by region of origin within a single racial group. We report allele frequencies for our Asian/Hmong and American Indian populations; these two minority groups are not typically selected for population allele frequency detection. Population wide allele frequencies are important for the design and implementation of studies and for determining the relevance of a disease associated polymorphism for a given population.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Enhanced protein fold recognition through a novel data integration approach

Author: A Andreeva
A Rakotomamonjy
AL Yuille
B Schölkopf
C Ding
CA Micchelli
CE Rasmussen
Colin Campbell
DT Jones
F Bach
F Bach
GRG Lanckriet
GRG Lanckriet
HB Shen
HW Mewes
I Dubchak
J Shawe-Taylor
J Ye
J Ye
JM Borwein
JV Davis
K Bleakley
K Chou
K Tsuda
Kaizhu Huang
L Liao
L Lo Conte
L Sun
L Vandenberghe
M Girolami
N Aronszajn
N Cristianini
ND Lawrence
PD Tao
R Hettich
RI Kondor
S Amari
S Ji
S Sonnenburg
T Damoulas
T Hastie
T Kato
Y Lin
Y Nesterov
Y Yamanishi
Y Ying
Yiming Ying
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Protein fold recognition is a key step in protein three-dimensional (3D) structure discovery. There are multiple fold discriminatory data sources which use physicochemical and structural properties as well as further data sources derived from local sequence alignments. This raises the issue of finding the most efficient method for combining these different informative data sources and exploring their relative significance for protein fold classification. Kernel methods have been extensively used for biological data analysis. They can incorporate separate fold discriminatory features into kernel matrices which encode the similarity between samples in their respective data sources. Results In this paper we consider the problem of integrating multiple data sources using a kernel-based approach. We propose a novel information-theoretic approach based on a Kullback-Leibler (KL) divergence between the output kernel matrix and the input kernel matrix so as to integrate heterogeneous data sources. One of the most appealing properties of this approach is that it can easily cope with multi-class classification and multi-task learning by an appropriate choice of the output kernel matrix. Based on the position of the output and input kernel matrices in the KL-divergence objective, there are two formulations which we respectively refer to as <it>MKLdiv-dc </it>and <it>MKLdiv-conv</it>. We propose to efficiently solve MKLdiv-dc by a difference of convex (DC) programming method and MKLdiv-conv by a projected gradient descent algorithm. The effectiveness of the proposed approaches is evaluated on a benchmark dataset for protein fold recognition and a yeast protein function prediction problem. Conclusion Our proposed methods MKLdiv-dc and MKLdiv-conv are able to achieve state-of-the-art performance on the SCOP PDB-40D benchmark dataset for protein fold prediction and provide useful insights into the relative significance of informative data sources. In particular, MKLdiv-dc further improves the fold discrimination accuracy to 75.19% which is a more than 5% improvement over competitive Bayesian probabilistic and SVM margin-based kernel learning methods. Furthermore, we report a competitive performance on the yeast protein function prediction problem.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Explore Bristol Research